A note on obtaining correct marginal predictions from a random intercepts model for binary outcomes.
نویسندگان
چکیده
BACKGROUND Clustered data with binary outcomes are often analysed using random intercepts models or generalised estimating equations (GEE) resulting in cluster-specific or 'population-average' inference, respectively. METHODS When a random effects model is fitted to clustered data, predictions may be produced for a member of an existing cluster by using estimates of the fixed effects (regression coefficients) and the random effect for the cluster (conditional risk calculation), or for a member of a new cluster (marginal risk calculation). We focus on the second. Marginal risk calculation from a random effects model is obtained by integrating over the distribution of random effects. However, in practice marginal risks are often obtained, incorrectly, using only estimates of the fixed effects (i.e. by effectively setting the random effects to zero). We compare these two approaches to marginal risk calculation in terms of model calibration. RESULTS In simulation studies, it has been seen that use of the incorrect marginal risk calculation from random effects models results in poorly calibrated overall marginal predictions (calibration slope <1 and calibration in the large ≠ 0) with mis-calibration becoming worse with higher degrees of clustering. We clarify that this was due to the incorrect calculation of marginal predictions from a random intercepts model and explain intuitively why this approach is incorrect. We show via simulation that the correct calculation of marginal risks from a random intercepts model results in predictions with excellent calibration. CONCLUSION The logistic random intercepts model can be used to obtain valid marginal predictions by integrating over the distribution of random effects.
منابع مشابه
A generalized linear mixed model for longitudinal binary data with a marginal logit link function.
Longitudinal studies of a binary outcome are common in the health, social, and behavioral sciences. In general, a feature of random effects logistic regression models for longitudinal binary data is that the marginal functional form, when integrated over the distribution of the random effects, is no longer of logistic form. Recently, Wang and Louis (2003) proposed a random intercept model in th...
متن کامل182-2007: Marginal Interpretation of Subject-Specific Curves: Logistic-Normal Regression
We propose the percentile curves concept as conditional probabilities curves across representative percentiles of the distribution of curves induced by random effects in a logistic model with random intercepts. We extend this concept to a logistic model with random intercepts and slopes and propose a methodology to approximate the percentile curves using the Monte-Carlo technique. We apply this...
متن کاملEnhanced Predictions of Tides and Surges through Data Assimilation (TECHNICAL NOTE)
The regional waters in Singapore Strait are characterized by complex hydrodynamic phenomena as a result of the combined effect of three large water bodies viz. the South China Sea, the Andaman Sea, and the Java Sea. This leads to anomalies in water levels and generates residual currents. Numerical hydrodynamic models are generally used for predicting water levels in the ocean and seas. But thei...
متن کاملCorrelations and Predictions of THF + 2-Alkanol Binary Mixtures Behaviour by PC-SAFT Model and Friction Theory
In this article the behavior of tetrahydrofuran (THF) + 2-alkanol namely 2-propanol, 2-butanol, 2-pentanol, 2-hexanol and 2-heptanol binary mixtures through the density and viscosity measurements have been studied as a function of composition and within the temperature range of 293.15–313.15 K. The excess molar volume, isobaric thermal expansivity, partial molar volumes, and viscosity deviation...
متن کاملAssessing discriminatory ability of random effects logistic models for clustered binary outcomes
In multicentre studies patients are typically clustered within centres and are likely to be correlated. Typically, random effects logistic models are fitted to clustered binary outcomes. However, limited work has been done to assess the discriminatory ability of these models: the ability of the model to distinguish between low-and high-risk patients. The C-index has been used to assess discrimi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- BMC medical research methodology
دوره 15 شماره
صفحات -
تاریخ انتشار 2015